ALMOST SURE INVARIANCE PRINCIPLE FOR DYNAMICAL 
SYSTEMS BY SPECTRAL METHODS 



SEBASTIEN GOUEZEL 



Abstract. We prove the almost sure invariance principle for stationary M'*- 
valued processes (with dimension- independent very precise error terms), solely 
under a strong assumption on the characteristic functions of these processes. 
This assumption is easy to check for large classes of dynamical systems or Markov 
chains, using strong or weak spectral perturbation arguments. 



The almost sure invariance principle is a very strong reinforcement of the cen- 
tral limit theorem: it ensures that the trajectories of a process can be matched 
with the trajectories of a brownian motion in such a way that, almost surely, the 
error between the trajectories is negligible compared to the size of the trajectory 
(the result can be more or less precise, depending on the specific error term one 
can obtain). This kind of results has a lot of consequences, see e.g. |MN09j and 
references therein. 

Such results are well known for one-dimensional processes, either independent 
or weakly dependent (see among many others |DP84t[HK82] ). and for independent 
higher dimensional processes |Ein89t[Zai98] . However, for weakly dependent higher 
dimensional processes, difficulties arise since the techniques relying on Skorokhod 
representation theorem do not work efficiently. In this direction, an approxima- 
tion argument introduced by |BP79j was recently generalized to a large class of 
weakly dependent sequences in |MN09j : their results give explicit error terms in 
the vector- valued almost sure invariance principle, and are applicable when the 
variables under study can be well approximated with respect to a suitably chosen 
filtration. In particular, these results apply to a large range of dynamical systems 
when they have some markovian behavior and sufficient hyperbolicity. 

Unfortunately, it is quite common to encounter dynamical systems for which 
there is no natural well-behaved filtration. It is nevertheless often easy to prove 
classical limit theorems, by using another class of arguments relying on spectral 
theory: these arguments automatically yield a very precise description of the char- 
acteristic functions of the process under study, thereby implying limit theorems. 
It is therefore desirable to develop an abstract argument, showing that enough 
control on the characteristic functions of a process implies the almost sure in- 
variance principle, for vector-valued observables. This is our goal in this paper. 
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[BP791 Theorem 5] gives such a result, but its assumptions are too strong for the 
apphcations we have in mind. Moreover, even when the previous approaches are 
apphcable, our method gives much sharper error terms. 

We will state our main probabilistic result. Theorem II. 2 ^ in the next paragraph, 
and describe applications to dynamical systems and Markov chains in Section [2j 
The remaining sections are devoted to the proof of the main theorem. 

1. Statement of the main result 

For d > 0, let us consider an R'^-valued process {Aq, Ai, . . . ), bounded in for 
some p > 2. Under suitable assumptions to be introduced below, we wish to show 
that it can be almost surely approximated by a brownian motion. 

Definition 1.1. For A G (0, 1/2] and a (possibly degenerate) symmetric semi- 
positive- definite d X d matrix, we say that an Mf^ -valued process {Aq, Ai, . . .) sat- 
isfies an almost sure invariance principle with error exponent A and limiting co- 
variance if there exist a probability space Q and two processes {A^, A^, . . . ) and 
{Bq, Bi, . . . ) on Q such that 

(1) The processes {Aq, Ai, . . . ) and {Aq, Al, . . . ) have the same distribution. 

(2) The random variables Bq, Bi, . . . are independent, distributed as Af{0, S^). 

(3) Almost surely in Q, 

n— 1 n—1 

(1-1) 

A brownian motion at integer times coincides with a sum of i.i.d. gaussian vari- 
ables, hence this definition can also be formulated as an almost sure approximation 
by a brownian motion, with error o(n^). 

Under some assumptions on the characteristic function of {Aq, Ai, . . .), we will 
prove that this process satisfies an almost sure invariance principle. To simplify 
notations, for t and x G W^, we will write e**^ instead of e'^^^'^K 

Let us state our main assumption ([H]), ensuring that the process we consider is 
close enough to an independent process: there exist eo > and C,c> such that, 
for any n,m > 0, any 6i < 62 < ■ ■ ■ < ^n+m+i, any A; > and any ti, . . . , tn+m ^ 
with \tj\ < eo, we have 

E 



< C(l + max - 6j-|)^("+'")e~^^ 

This assumption says that, if one groups the random variables into n + m blocks, 
then a gap of size k between two blocks gives characteristic functions which are 
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exponentially close (in terms of k) to independent characteristic functions, with 
an error which is, for each block, polynomial in terms of the size of the block. This 
control is only required for Fourier parameters tj close to 0. 

Of course, it is trivially satisfied for independent random variables. The inter- 
esting point of this assumption is that it is also very easy to check for dynamical 
systems when the Fourier transfer operators are well understood, see Theorem 12.11 
below. 

Our main theorem follows. 

Theorem 1.2. Let {Aq, Ai, . . .) be a centered -valued stationary process, in 
for some p > 2, satisfying ([H]). Then 

(1) The covariance matrix cov(^"rJ Ai)/n converges to a matrix E^. 

(2) The sequence X]"=o ^e/V^ converges in distribution to A/'(0, S^). 

(3) The process {Aq, Ai, . . .) satisfies an almost sure invariance principle with 
limiting covariance S^, for any error exponent 



When p = oo, the condition on the error becomes A > 1/4, which is quite good 
and independent of the dimension: this condition A > 1/4 had previously been 
obtained only for very specific classes of dynamical systems (in particular closed 
under time reversal), for real-valued observables (see e.g. |FMT03t IMT04j ). 

If the process is not stationary, we need an additional assumption to ensure the 
(fast enough) convergence to a normal distribution: 

Theorem 1.3. Let [Aq, Ai, . . . ) be an -valued process, bounded in for some 
p > 2, satisfying ([H]). Assume moreover that < ^> ^''^^ ^^^^ there 

exists a matrix such that, for any a > 0, 



uniformly in m,n. Then the sequence Yll=o ^i/V^ converges in distribution to 
A/'(0, S^). Moreover, the process {Aq, Ai, . . .) satisfies an almost sure invariance 
principle with limiting covariance S^, for any error exponent A > p/{^p — 4). 

Theorem 11.21 is in fact a consequence of Theorem II. 3^ since we will prove in 
Lemma 12.71 that a stationary process satisfying ([H]) always satisfies fll.3p (even 
more, this inequality holds with a = 0). 

Contrary to the results of [BP79j . our results are dimension- independent for 
i.i.d. random variables (but they are not optimal in this case, see |Ein89t IZai98[ 
IZai07] : for i.i.d. sequences in L^, 2 < p < oo, the almost sure invariance principle 
holds for any error exponent A > 1/p). 

In this paper, C will denote a positive constant whose precise value is irrelevant 
and may change from line to line. 



(1.2) 



A > 



1 1 



4p - 4 4 (4p - 4) ' 



(1.3) 




INVARIANCE PRINCIPLE VIA SPECTRAL METHODS 



4 



2. Applications 



2.1. Coding characteristic functions. Let us consider first a very simple ex- 
ample: let T(x) = 2x mod 1 on the circle = M/Z, and consider a Lipschitz 
function f : ^ of vanishing average for Lebesgue measure. We would like to 
prove an almost sure invariance principle for the process {f{x), f{Tx), f{T'^x), . . . ), 
where x is distributed on according to Lebesgue measure. Define an operator 
Ct on Lipschitz functions by Ctu{x) = X]t{j/)=x It is then easy to 

check that, for any to, ... , tn-i in M'^, 



Using the good spectral properties of the operators Ct, it is not very hard to show 
that this implies (|H|) . 

In more complicated situations, it is often possible to encode in the same way 
the characteristic functions of the process under study into a family of operators. 
However, these operators may act on complicated Banach spaces (of distributions, 
or measures). It is therefore desirable to introduce a more abstract setting that 
encompasses the essential properties of such a coding, as follows 

Consider an M'^-valued process {Aq, Ai, . . .). Let i3 be a Banach space and let 
Ct (for t G M'^, \t\ < eo) be linear operators acting continuously on B. Assume that 
there exist uq E B and G B' (the dual of B) such that, for any to, ... , tn-i G M"^ 
with \tj\ < eo. 



In this case, we say that the characteristic function of {Ao,Ai, . . . ) is coded by 

{B, {Ct)\t\<eo,UO,^o)- 

We claim that the assumption ([H]) follows from suitable assumptions on the 
operators Ct, that we now describe. 

(11) One can write Co = U + Q where 11 is a one-dimensional projection and Q 
is an operator on B, with QII = HQ = 0, and HQ^Hg^^? < Ck"" for some 
K < 1. 

(12) There exists C > such that ||>C"||g^g < C for all n G N and all small 
enough t. 

We will denote this set of conditions by (I). 

Theorem 2.1. Let (Ai) be a process whose characteristic function is coded by a 
family of operators (Ct), and which is bounded in for some p > 2. Assume that 
(I) holds. Then there exist a G M'^ and a matrix such that (X]"=o^ ~ '^'^) / 
converges to Af{0, S^). Moreover, the process {Ai — a)^gN satisfies an almost sure 
invariance principle with limiting covariance for any error exponent larger than 



(2.1) 




(2.2) 




p/(4p - 4). 
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The proof will exhibit a as the limit of E{Aii), give a formula for S^, and derive 
the theorem from Theorem 11.31 since flHj) and fll.3p follow from (I). Even better, 
we have under the assumptions of Theorem 12.11 



This is proved in Lemma [2.71 below. 

Remark 2.2. Let us stress that the assumptions of this theorem are significantly 
weaker than those of similar results in the literature: we do not require that a 
perturbed eigenvalue has a good asymptotic expansion, nor even that such an 
eigenvalue exists. In particular, to the best of the author's knowledge, the central 
limit theorem was not known under the assumptions of Theorem 12.11 

Before we prove Theorem 12. H at the end of this section, let us describe some 
applications. We explain how to check (I) in several practical situations. Let 
T : X — )■ X be a dynamical system, let /i be a probability measure (invariant or 
not), and let / : X — )■ W^. We want to study the process (/, / o T, / o T^, . . . ). 

2.2. Strong continuity. Assume that the characteristic function of the process 
(/, / o T, / o T^, . . . ) can be coded by a family of operators Ct on a Banach space 
and that the operator Co satisfies (II), i.e., it has a simple eigenvalue at 1, the 
rest of its spectrum being contained in a disk of radius /t < 1 (such an operator is 
said to be quasicompact) . 

Proposition 2.3. If the family Ct : B ^ B depends continuously on the parameter 
t at t = 0, then (12) is satisfied. 

Proof. By classical perturbation theory, the spectral picture for Cq persists for 
small t: we can write Ct = X{t)Ilt + Qt where X{t) G C, 11^ is a one-dimensional 
projection and < Ck" for some k < 1, uniformly for small t. If |A(t)| < 1 

for small t, we obtain (12). 
For small t, we have 



When t — i- 0, by continuity, the quantity (^o; ^t^o) converges to (^o; Hmq) = 1 (see 
(12. 8 p below). In particular, for small enough t, {^o,IltUo) ^ 0. Since the right 
hand side of (12. 4 p is bounded by 1, this gives |A(t)| < 1, concluding the proof. □ 

Let us be more specific. Let T be an irreducible aperiodic subshift of finite 
type, let m be a Gibbs measure, and let / : X — M'^ be Holder continuous with 
/ / dm = 0. Let £ be the transfer operator associated to T, defined by duality by 
/ u-voT dm = f Cu-v dm, and define perturbed operators Ct by Ctiu) = C{e^^^u). 
These operators code the characteristic function of the process (/, / o T, . . . ) and 



(2.3) 




E{e- 



= X{tn^o,TltUo)+0{K-). 



(2.4) 
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depend analytically on t (this follows from the series expansion e*^ = 
and the fact the Holder functions form a Banach algebra). The condition (I) is 
checked e.g. in |GH88j . |PP90j . Hence, Theorem 12.11 gives an almost sure invariance 
principle for any error exponent > 1/4. This result is not new, it is already given 
in |MN09] . though with a weaker error term. 

If T is an Anosov or Axiom A map and / : X — )■ M"^ is Holder continuous, 
then the same result follows using symbolic dynamics. One can also avoid it and 
directly apply Theorem 12.11 to the transfer operator acting on a Banach space B 
of distributions or distribution-like objects, as in |BT07t [GL08] . 

Let now T : X ^ X he a piecewise expanding map, and assume that the 
expansion dominates the complexity (in the sense of |SauOOt Lemma 2.2]) - this 
setting includes in particular all piecewise expanding maps of the interval, since 
the complexity control is automatic in one dimension. Let / : X — t- M'^ be /3- 
Holder continuous for some f3 e (0, 1]. Then the perturbed transfer operator Ct 
acts continuously on the Banach space = introduced in [SauOOj . and depends 
analytically on t (since S is a Banach algebra). With Theorem 12. ![ we get an 
almost sure invariance principle for any error exponent > 1/4. This result was 
only known for dim(X) = 1 and d = 1, thanks to |HK82j . 

This result also applies to coupled map lattices, since |BGK07j shows (I) for 
such maps. Let us mention that the Banach space B here is not a Banach space 
of functions or distributions, but this is of no importance for our abstract setting. 

Assume now that T is the time-one map of a contact Anosov flow. [Tsu08j 
constructs a Banach space of distributions on which the transfer operator C acts 
with a spectral gap. If / is smooth enough, then Ct '■= jC(e**-^-) depends analytically 
on t. We therefore obtain an almost sure invariance principle for any error exponent 
> 1/4. This result was known for real- valued observables |MT02j . but is new for 
M'^-valued observables. However, our method does not apply to the whole class of 
rapid-mixing hyperbolic flows, contrary to the martingale arguments of [MT02j . 

Finally, assume that T : X — )■ X is a mixing Gibbs-Markov map with invariant 
measure m, i.e., it is Markov for a partition a with infinitely many symbols, and 
has the big image property and Holder distortion (this is a generalization of the 
notion of subshift of finite type to infinite alphabets, see e.g. |MN091 Section 3.1] for 
precise definitions). For / : X — )■ M*^ and a G a, let Df{a) denote the best Lipschitz 
constant of / on a. Consider / of zero average, such that J2a€a m{a)Df{a)P < oo, 
for some p G (0, 1] (this class of observables is very large, it contains in particular 
all the weighted Lipschitz observables of |MN09l Section 3.2]). 

Theorem 2.4. // / G for some p > 2, then the process (/, / o T, . . . ) satisfies 
an almost sure invariance principle for any error exponent > p/ {Ap — 4). 

This follows from |Gou08t Section 3.1], where a Banach space B satisfying the 
assumptions of Proposition 12.31 is constructed. 

Let us mention that the almost sure invariance principle is invariant under the 
process of inducing, i.e., going from a small dynamical system to a larger one. A lot 
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of hyperbolic dynamical systems can be obtained by inducing from Gibbs-Markov 
maps, and the previous theorem implies an almost sure invariance principle for all 
of them (see [MN09j for several examples). 

Remark 2.5. In such dynamical contexts (when the measure is invariant and 
ergodic), the matrix is degenerate if and only if / is an coboundary in 
some direction. Indeed, if is degenerate, it follows from (12. 3p that there is a 
nonzero direction t such that (t, S'„/) is bounded in L^. By Leonov's Theorem 
(see e.g. jAWOOj ). this implies that {t, f) is an coboundary: there exists u E L"^ 
such that (t, /) = u — u oT almost everywhere. Conversely, this condition implies 
that T? is degenerate. 

2.3. Weak continuity. In several situations, the strong continuity assumptions 
of the previous paragraph are not satisfied, while a weaker form of continuity holds. 
We describe such a setting in this paragraph. 

Assume again that the characteristic function of a process (/, / o T, / o T^, . . . ) 
is coded by a family of operators Ct on a Banach space B, and that the operator 
£o satisfies (II), i.e., it is quasicompact with a simple dominating eigenvalue at 1. 

We do not assume that the map t h- >■ £t is continuous from a neighborhood 
of to the set of linear operators on S, hence classical perturbation theory does 
not apply. Let C be a Banach space containing B on which the operators Ct act 
continuously, and assume that there exist M > 1, k < 1 and C > such that 

(1) For all n G N and \t\ < cq, we have H^rllc^c ^ 

(2) For all n G N, all \t\ < eo and all u e B, we have < C/t" \\u\\q + 
CM" IImII^. 

(3) The quantity — i2o|le^c tends to when t — )■ 0. 

Then |KL99t [Liv03j show that, for small enough t, the operator Ct acting on B has 
a simple eigenvalue \(t) close to 1, and Ct can be written as \(t)Ilt + Qt where lit 
is a one- dimensional projection and, for some C > and k < 1, 

llHtllg^g < C, ligrile^s < Ck"", \\Ut - nilg^c ^ when t 0. 
Therefore, (12) follows from the arguments in the proof of Proposition 12.31 if we 
can prove that (^o, ^tUo) {^o, Hmq) when t — > 0. By the last estimate in the 
previous equation, this is true if belongs not only to B' but also to C, which is 
usually the case. 

2.4. Markov chains. Consider a Markov chain Xo,Xi, . . . (with an initial mea- 
sure /i, and a stationary measure m possibly different from yu), on a state space X. 
Let also / : A* — )■ M with Em{f) = 0, we want to study the process Ae = f{Xi). 

Denote by P the Markov operator associated to the Markov chain, and define a 
perturbed operator Ptiu) = P{e^^^u). Then 

^^(e^EL-o^t.^.) = ^^(e*ELt*^/(^^) . E(e**"-^(^"-)|X„_2)) 
= i^^(e*ELt*./(^.)p,_j(X„_2)). 
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By induction, we obtain 



(2.5) 



This is very similar to the coding property introduced in fl2.2p . the small difference 
being that the composition is made in the reverse direction. In particular, the 
proof of Theorem 12.11 still works in this context. We obtain the following result: 

Proposition 2.6. Let B be a Banach space of functions on X such that 1 & B, 
and the integration against /i is continuous on B. If the operators Pt satisfy on 
B the condition (I), then the process f{X^) satisfies ([H]). If f{Xi) is hounded in 
LP for some p > 2, it follows that the process (/(X^)) satisfies an almost sure 
invariant principle, for any error exponent X > p/ (4p — 4). 

To check the condition (I) , the arguments of Paragraph 12.21 or 12.31 can be ap- 
plied (if the Banach space B is carefully chosen, depending on the properties of 
the random walk under study). We refer in particular the reader to the article 
|HP08] . where several examples are studied, including uniformly ergodic chains, 
geometrically ergodic chains, and iterated random Lipschitz models. It is in par- 
ticular shown in this article that the weak continuity arguments of Paragraph 12.31 
are very powerful in some situations where the strong continuity of Paragraph 12.21 
does not hold. 

2.5. Proof of Theorem 12.11 assuming Theorem 11.31 

First step: there exists Ui E B such that, for to, ... , t^-i £ B{0, eo), 

(2.6) n(£t„_, ■ ■ ■ Ct,,uo) = (^0, ■ ■ • Ct^uo)ui. 

Since 11 is a rank one projection, we can write Il{u) = {^2, u)u2 for some U2 E B 
and ^2 G with {^2,^2) = 1. The trivial equality 



Let u = Ct^_^ ■ ■ ■ CtgUQ. When tends to infinity, £q tends to 11. Hence, letting 
N tend to 00 in the previous equality, we get 




>n + JV-l 




gives, using the coding by the operators Ct, 




(2.8) (^0, uo) = (^0, nno) = lim(eo, C^o) = limE(e^^^-"o o-^^) = 1. 



Taking u = uq in (12.71) . this implies in particular {^o,U2) 7^ 0. Finally, 

n(M) = (6,m)m2 = (^0,m)m2/(Co,M2). 

We obtain (12.61) for ui = U2/ {C,o,U2). 



INVARIANCE PRINCIPLE VIA SPECTRAL METHODS 



9 



Second step: (|H|) holds. 

Consider 61 < ■ ■ ■ < bn+m+i, as well as ti, . . . , tn+m ^ -8(0, eg) and k > 0. Then 



(2.9) 



/'''n+m + l— fen + m + ''n+1 /'fc r'bn+l—bn r<b2—bl fbl 

SO, ■ ■ ■ ''-O'^-tn ■ ■ ■ ''-ii ''-O "0 

C £bn+m + l—bn + m £bn + 2—bn+\ / £k -Q\^fen + 1— C}'^^^^ C!'^ 11 

All the operators satisfy ||>^tj|g^g ^ C*. Since ||£q — n||g_^^ < Ck^ for some 
K < 1, it follows that the term on the line before the last one in (I2.9p is bounded 
by C^^'^K^. Moreover, by ( I2.6p . the term on the last line is 



C r'bn+m+l—bn+m r'bn+2—bn+i \ fen nbi—b\ r-b\ ^ \ 

-1 



The second factor in this equation is simply E \ e V ^ / . Moreover 





hi — ^n+J7 


■ ■ ■ ^i„+i ^0 Wo; 






,2-fe„+i-^^^^ + 0(C™k''"+i+'= 




■■4": 





Therefore, the last line of (12. 9 p is equal to 



We have proved that a difference to be estimated to check ((Hj) is bounded by 
(jn+m^k _^ (jm^b^+i+k ^Qxae C > I and k < 1. Write C = 2*^' and n = e"" for 
some c, C" > 0, then this is error is at most 

2 ■ 2^'("+'™)e-"'^ < 2 ■ (1 + max - 6^- 1)^' 6""^ 

This proves ( iHj) . 



Third step: there exist a G M'^ and C,6 > such that \E{Af:) — a\ < Ce~^^. 

Working component by component, we can without loss of generality work with 
one- dimensional random variables. 

Enriching the probability space if necessary, we can construct a centered ran- 
dom variable V, independent of all the A^, belonging to L^, whose characteristic 
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function is supported in 5(0, eo) (see Proposition 13.81 for the existence of V). Let 
also T > 0. Then 

E{Ae) = E{Ae + V) = E{{Ae + V)1ia,+v\>t) + [ x dPA.+v 

J\x\<T 

The first term is bounded by \\Ae + V\\^2 ||l|A,+y|>T||^2 < CP{\Ae + V\ > Tf'^ < 
CjT^I'^. Let <\)i{t) = E(e**^*)E(e**^) be the characteristic function of Ai + V. Let 
qt be the Fourier transform of x1\x\<t- Since the Fourier transform on R is an 
isometry up to a constant factor ci, we have f^^^^j-x dPA^+v = ci / hence 

We have 

Mt) = (eo, CtC',uo)Eie'''') = (^0, AnMo)i?(e**^) + (^o, A(4 - Il)uo)Eie'''') 

The function ip is independent of i, while the function ri{t) depends on i, is 
bounded by Ck^ and is supported in {\t\ < eo}. We obtain 



E{Ae) = cijg^ij + cij g^r, + 0{T-^'^) 

The norm of gx is equal to C \\x1ix\<t\\i^2 = CT^^'^, we therefore obtain 

E{A,) = c^jwi^ + 0{k'T'/') + 0(T-V2). 

Consider now k,i gN. Taking T = /--min(fc/)/3^ obtain for some S > 

\E{Ae) - E{Ak)\ < Ce-^™'^^*^'^^ 

This shows that the sequence E{Ae) is a Cauchy sequence, therefore converging to 
a limit a. Moreover, letting /c — )■ oo, it also yields \E{Ae) — a\ < Ce~^^ as desired. 

Fourth step: conclusion of the proof. 

We claim that, for any m G N, there exists a matrix Sm such that, uniformly in 

i, m, 

(2.10) I cov(A,, Ae+m) -Sm\< Ce-''. 

Since the proof is almost identical to the third step, it will be omitted. 

Lemma 2.7. Let (Ai) be a process bounded in for some p > 2, satisfying 
(iHj) . and fl2.10p for some sequence of matrices Sm- Then the series = sq + 
'Y^^=i{^rn + s^) converges in norm and, uniformly in m,n, 

/ m+n— 1 

(2.11) cov 



5^ A, - nS^ 



l=in 



< c. 
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Let us admit this lemma for the moment. Then the process {Ai — a) satisfies all 
the assumptions of Theorem 11.31 Theorem 12.11 follows from this theorem. □ 



Proof of Lemma 2/1 . Let us first prove that, for some 5 > 0, 



(2.12) 



cov(v4£, < Ce 



~5m 



We will assume that d = 1 to simplify the notations. The estimate f l2.12p follows 
easily from the techniques we will develop later in this paper, but we will rather 
give a direct elementary proof. Let V, V be two independent random variables as 
in the third step of the previous proof. Then 

E{AiA,+^) = EiiAe + V){A,+m + V')) = j xy dP(x,y), 

where P is the distribution of {An + V, A^+m + V). For T > 0, we have 

j \xy\l\^\>T AP{x,y) = E{\A, + V\\A,+^ + V'\l\A, 



-^V\>T) 



< WAp 



V\ 



LP 



\Ae+m + V'\\^2 ||l|A|+V'|>T||^q , 



\Af+V\>T\ 



LI 



where g > 1 is chosen so that 1/p + 1/2 + 1/q = 1. Moreover, 
Pi\Ai + V\ > r)^/9 < CT-^/'>. We have proved that, for some p > 0, we have 
^ \xy\l\x\>T dP(x,?/) < CT~P. In the same way, ^ \xy\l\y\^T dP{x,y) < CT~p. 
Therefore, 



E{Ai,Ai,+^) = j xyli^iiyi<T dP{x, y) + 0{T~ 



The characteristic function (p of {Ap + V, A^+m + V) is given by 

it is therefore supported in \u\ < eo}. Denoting by hj- the Fourier transform 
of the function xyl\x\^\y\<Ti and using the fact that the Fourier transform is an 
isometry up to a constant factor C2 = c\, we get 



E{A,A 



e+m) 



C2 / hT(t) + 0{T- 



Letting ip(t,u) = P(e**^^)E(e*"^ 
that 



'^+'" )i?(e**^)i?(e*"^'), a similar computation shows 
E{Ae)E{A,+J =C2 !h^iP + 0{T-p). 



Therefore, 

\E{A,A, 



E{A,)E{A,+^)\ = C2 

<C\\hT\\L2 



lL2 



0{T-P) 
0{T-P) 
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COV 



The function (p—ip is supported in {\t\, \u\ < eo}, and ([H]) implies that it is bounded 
by Ce~^"^ for some c > 0. Moreover, ||/it||2,2 = C \\xylix\,\y\<T\\j^2 ^ CT^. Finally, 
we obtain 

\E{AeAe+m) - E{Ae)E{Ae+m)\ < Ce'^'^T^ + CT^'. 
Choosing T = e"™/^ this gives f l232|) . 

When £ — )■ oo, cov(y4£, Ai+m) tends to Sm by assumption. Therefore, letting i 
tend to infinity in fl2.12p . we get < Ce~^"^. From fl2.10p . we obtain 

(2.13) I cov(A,, - s„,\ < Cmm{e-'',e-'"'). 

We have 

(m+n—l \ n— 1 

I = ^ COv(y4j+„) 
e=m J i=0 

+ ^ ] (C0v(y4j+m, ^j+m) + COv(y4j_|_m, ^j+m)*)- 
0<i<j<n— 1 

With f l27[3|) . we get 

''m+n— 1 \ n— 1 

£=m / i=0 0<i<j<n-l 

<CJ2 e"'^('+'"^ +C Yl min(e-^(*+^'"), e--^^^-')). 

i=0 0<i<j<n-l 

Up to a multiplicative constant C, this is bounded by 

oo oo 2j oo oo 

E^-" + EE»-"+E E 

i=0 i=0 j=i+l i=0 j=2i+l 

We have proved that 

AA - nsQ -Yi^ - k)isk + si) <C. 

e=m / k=l 

Since Yl^l^k + sl\ < oo, this proves fl2.1ip . □ 

3. Probabilistic tools 

3.1. Coupling. As in |BP79] . the notion of coupling is central to our argument. 
In this paragraph, we introduce this notion. 

If Zi : Qi El and ^2:^72—^ E2 are two random variables on two (possibly 
different) probability spaces, a coupling between Zi and Z2 is a way to associate 
those random variables, usually so that this association shows that Zi and Z2 
are close in some suitable sense. Formally, a coupling between Zi and Z2 is a 
probability space fl' together with two random variables Z[ : Q ^ Ei and Z'2 : 



e-^^^-'^ < 00. 



cov 
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Q ^ E2 such that Z'^ is distributed as Zi. Considering the distribution of {Z[, Z'2) 
in El X E2, it follows that one may take without loss of generality Q = Ei x E2, 
with Z[ and Z2 the first and second projection. 

The following lemma, also known as the Berkes-Philipp lemma, is Lemma A.l of 
|BP79] . It makes precise and justifies the intuition that, given a coupling between 
two random variables Zi and Z2, and a coupling between Z2 and another random 
variable Z3, then it is possible to ensure that those couplings live on the same 
probability space, giving a coupling between Zi, Z2 and Z^. 

Lemma 3.1. Let Ei, i = 1,2,3, be separable Banach spaces. Let F be a distri- 
bution on El X E2, and let G be a distribution on E2 x E^ such that the second 
marginal of F equals the first marginal of G. Then there exist a probability space 
and three random variables Zi, Z2, Z3 defined on this space, such that the joint 
distribution of Zi and Z2 is F, and the joint distribution of Z2 and Z3 is G. 

As a typical application of this lemma, assume that two processes (Xi, . . . , X„) 
and (Fi, . . . , F„) are given, and that a good couphng exists between variables X 
and Y distributed respectively like and 'Y^Yi. Then there exists a coupling 

between (Xi, . . . , X^) and (Yi, . . . , F„) realizing this coupling between ^ Xi and 
^ : it is sufficient to build simultaneously 

• the trivial coupling between (Xi,...,X„) and X such that X = 
almost surely. 

• the given coupling between X and Y . 

• the trivial coupling between Y and (Yi, . . . , Yn) such that Y = 'Y^Yi almost 
surely. 

This kind of arguments will be used several times later on, without further details. 

We will need the following lemma. It ensures that, to obtain a coupling with 
good properties between two infinite processes (Zi, Z2, . . . ) and Z'2, . . . ), it is 
sufficient to do so for finite subsequences of these processes. 

Lemma 3.2. Let Un, Vn be two real sequences. Let Zn : Q En and Z'^ : Q' ^ E^ 
(n > 1) be two sequences of random variables, taking values in separable Banach 
spaces. Assume that, for any N, there exists a coupling between {Zi, . . . , Zn) and 
{Z'l,..., Z'j^) with 

(3.1) Pi\Zn-Z'J>Un)<v^ 

for any 1 < n < N. Then there exists a coupling between (Zi, Z2, . . . ) and 
{Z'l, Z'2, . . . ) such that (13. ip holds for any n G N. 

Proof. For all X G N, there exists a probability measure P/v on (-^'i x ■ ■ ■ x Ej^)"^, 
with respective marginals the distributions of {Zi, . . . , Z^) and (Z{, . . . , Z'j^), such 
that P]\f{\zn — z'n\ > Un) < Vn for 1 < n < N, where z„ and z'n denote the 
coordinates in the first and the second En factor. Let us extend arbitrarily this 
measure to a probability measure on E"^, where E = Ei x E2 x . . . . The 
sequence Pn is tight, and any of its weak hmits satisfies the required property. □ 
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3.2. Prokhorov distance. 

Definition 3.3. If P,Q are two probability distributions on a metric space, define 
their Prokhorov distance 7r(P, Q) as the smallest e > such that P{B) < e + Q{B'^) 
for any borelian set B, where B'^ denotes the open e-neighborhood of B. 

This distance makes it possible to construct good couplings, thanks to the fol- 
lowing Strassen-Dudley Theorem [Bil99l Theorem 6.9]: 

Theorem 3.4. Let X, Y be two random variables taking values in a metric space, 
with respective distributions Px and Py- If tt^Px, Py) < c, then there exists a 
coupling between X and Y such that P{d{X, Y) > c) < c. 

We now turn to the estimation of the Prokhorov distance for processes taking 
values in W^. Let d > and > 0. We consider M'^^ with the norm 

\{xi, . . .,xn)\n = sup \Xi\, 

l<i<N 

where \x\ denotes the euclidean norm of a point x G M'^. 

Lemma 3.5. There exists a constant C{d) with the following property. Let F and 
G be two probability distributions on with characteristic functions and 7. 
For any T' > 0, 

N r „ -|l/2 

(3.2) vr(F,G)<5^F(|x,|>T') + (C(ci)T"^/2)^ / \<P - 



Proof. After an approximation argument, we can assume without loss of generality 
that F and G have densities / and g. Then, for any measurable set A, 

F{A) - G{A) < F{A n max \xj\ < T') + F(max \xj\ > T') - G{A n max \xj\ < T') 

r ^ 

< / \f-g\ + J2n\xj\>T'). 

J\xi\,...,\xm\<T' 

Therefore, 7r(F, G) is bounded by the right hand side of this equation. To conclude, 
we have to estimate f, , , , I f — gf|. We have 

/ \f-9\< Wf-gh^ ||i|.,i,...,|.,i<T'L. = ||0-7lL2(CT')'^^/^ 

J\xi\,...,\xm\<T' 

since the Fourier transform is an isometry on up to a factor (27?)^^^/^. This 
concludes the proof. □ 

3.3. Classical tools. Let us recall two classical results of probability theory that 
we will need later on. The first one is Rosenthal's inequality [Ros 70] . and the 
second one is a weak version of Gal-Koksma strong law of large numbers [PS75| 
Theorem Al] that will be sufficient for our purposes. 
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Proposition 3.6. Let Xi, . . . ,X„ he independent centered real random variables, 
and let p > 2. There exists a constant C{p) such that 

V2 / n \ ^IV 



(3.3) 



Proposition 3.7. Let Xi,X2, ... be centered real random variables, and assume 
that, for some q > 1 and some C > 0, for all m, n. 



(3.4) 



E 



m+n—l 



Then, for any a > 0, the sequence Ylj=i-^j/-^'^^'^~^" tends almost surely to 0. 

The following proposition will be used in several forthcoming constructions. 

Proposition 3.8. There exists a symmetric random variable V on M^, belonging 
to L'^ for any q > 1, whose characteristic function is supported in the set {\t\ < e^}. 

Proof. We start from a C°° function supported in {\t\ < eo/2}, and consider its 
inverse Fourier transform / = J^~^{(f)) (which is C°° and rapidly decreasing). Let 
9 = l/P = -^"^0*0) where (f){t) = (p^—t). Let finally h = g/ J g,it is nonnegative, 
has integral 1, and its Fourier transform is proportional to (p-kcj), hence supported 
ill {1^1 < Co}- Let W and W be independent random variables with density /i, 
then V = W — W satisfies the conclusion of the proposition. □ 



4. LP BOUNDS 

Our goal in this section is to show the following bound: 

Proposition 4.1. Let {Aq, Ai, . . . ) be a centered process, bounded in L^ (p > 2) 
and satisfying (jH]). For any t] > 0, there exists C > such that, for all m,n> 0, 

This kind of bound is classical for a large class of weakly dependent sequences. 
The main point of this proposition is that these bounds are established here solely 
under the assumption (jH]) on the characteristic function of the process. 

For the proof, we will approximate the process {Aq, Ai, . . .) by an independent 
process, using ([H]). Estimating the Lp~^ norm of this process thanks to Rosenthal's 
inequality (Proposition 13. 6p . this will yield the desired estimate. 

Lemma 4.2. Let {Aq, Ai, . . .) be a centered process bounded in L^ for some p > 2, 
and satisfying (jH]). LetUn = max^g^ HXl^tn"^ ^^ILa ■ For any a > 0, there exists 
C > such that Ua+b < + Wf, + C(l + a° + 6")(1 + u]/'^ + m^^) for any a,b > 1. 



(4.1) 



m+n— 1 



E -4. 
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^m+a+6— 1 
=m+a+\b°' 



Proof. Let m G N, and a < h. Write Xi = Y.7=Z~ and X2 = E 
Let also Xi = Xi + Vi and X2 = X2 + V2, where Vi and V2 are independent random 
variables distributed like V (constructed in Proposition I3.8p . Let finally Yi and Y2 
be independent random variables, distributed respectively like Xi and X2. 
Let us prove that, for some 6 = 6 (a) > 0, 



(4.2) 



7r((Xi,X2),(Yi,f2)) <Ce 



Let and 7 denote, respectively, the characteristic functions of {Xi,X2) and 
(Yi, Y2) where Yi and Y2 are independent copies of Xi and X2. Since there is a gap 
of size 6" between Xi and X2, ([H]) ensures that, for Fourier parameters at most eo, 
I0-7I < C(l + 6)^e-^*" < Ce-^'''". We have 0-7 = (0 - 7)£;(e^(*i^i+*2^2)). since 
the characteristic function of V is supported in {\t\ < eo}, this shows that the char- 
acteristic functions and 7 of (Xi, X2) and (Yi, l^) satisfy |0 — 7I < Ce'^'^" , and 
are supported in {\t\ < eo}. Applying Lemma [3.51 with N = 2 and T' = e^"^^, we 
obtain f l4.2p (since the first terms in f l3.2p are bounded by E{\Xi\)/T' < Cb/e''°'''^ , 
while the second term is at most CT"^e~'^'^"). 

By (14. 2 p and Theorem 13.41 we can construct a coupling between {Xi,X2) and 
(Yi, 12) such that, outside of a set O of measure at most Ce~* , we have — Fj| < 
Ce~^\ We have 



X1 + X2 



< 



L2 



lo(Xi+X2) 



L2 



lo=(-^i — Yi + X2 — Y2 



L2 



^-1 + ^-2 



L2 



The first term is bounded by ||lo||^q 



LP 



where q is chosen so that + 



X1+X2 

1/q = 1/2. Hence, it is at most Ce~^^/% < C. The second term is bounded by 
Ce'^ < C. Finally, since Yi and Y2 are independent and centered, the last term 
is equal to iE{Y^^) + E{Yi)y/\ 

Since ||V||j;^2 is finite, we finally obtain 



+ X2\\l2 <C + E{Yl) + E{Y? 

Taking into account the missing block 'Yl,t=m+a 
Cb") and using the trivial inequality \\U + V\\l2 < \\U\\l2 + \\V\\l2+2 
we finally obtain 



2j-C + E{Xt) + E{Xi). 
m+a+ [b \ 1 (whose uoTm is at most 

2 ^ llrrl|2 , ||TAI|2 , o llrrll ||t/|| 

L2 II ^ IIl2, 



m+a+fe— 1 


2 


m+a— 1 


2 


■m+a+fe— 1 


2 




< 


E 1' 


+ 


E ^' 




l=m 


L2 




L2 


£=m+a 


L2 



+ C6 



2a 



Cb° 



m+a— 1 



E -4. 

£=m 



+ 



L2 



m+a+6— 1 



E 

£=m+a 



A, 



L2/ 



This proves the lemma. 



□ 
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Lemma 4.3. Let u„ > satisfy 

(4.3) Ua+b <Ua + Uf, + C(l + a" + 6°)(1 + u^/^ + «;/'), 

for all a,b > 1 and some C > 0, a E {0, 1/2). Then Un = 0{n). 

Proof. For any e > and any x, y > 0, we have xy < ex"^ + e~^y'^. We therefore 
obtain from the assumption 

Ua+b < (1 + e){ua + Ub) + Ce"^ max(a2", fe^"). 

Let ffc = maxo<„<2fc+i Un- It follows from the previous equation that 

Vk+i < (2 + 2t)Vk + Ce-^22"^ 

In particular, 



2ak 



(2 + 2e)'=+i - (2 + 2e)^ (2 + 2e)'=+i ' 

It follows inductively that Vk/ (2 + 2e)^ < ^^o + Ce'"^ {2+2e)i+^ ^ Hence, for 
any e > 0, ffc = 0((2 + 2e)'^), i.e., for any p > 1, = ©(n''). Choosing p close 
enough to 1, we get from (14. 3 p Ua+b < + m;, + Ca'^ + Ch^ , for some /3 < 1. 
Therefore, Vk+i < 2^^ + C2'^'^. As above, we deduce that ffe/2'^ is bounded, i.e., 
Un = 0{n). □ 

Proof of Proposition 4-1 ■ Lemmas 14.21 and 14.31 show that a centered process in 
satisfying (jH]) satisfies the following bound in L^: 

m+n—l 



(4,4) 



L2 



Let us now show that the same bound holds in U'"^ for any > 0. 

Let a = 1/10. For n G N, let a = [n^~"\ and b = [n"J. Fix m G N, we 
decompose the interval [m, m + n) as the union of the intervals Ij = [m + ja, m + 
(j + l)a — 6^) and Jj = [m + (j + l)a — b'^,m + {j + l)a) for < j < b, and the 
last interval J = [m + ba,m + n). 

Write Xj = '^i^j^ Ae, and Xj = Xj + Vj where the Vj are independent and 

distributed like V constructed in Proposition 13.81 Finally, let Yq, . . . , Yj,_i be in- 
dependent random variables, such that Yj is distributed like Xj. We claim that, 
for some 6 > 0, for any j < b 

(4.5) 7r((Xo, . . . ,X,_i), (Xo, . . . , X,_2, F,-i)) < Ce-"'. 

Indeed, the Xj are blocks each of length at most n, and there are at most 
blocks. Since there is a gap of length 6^ = n^" between Xj_2 and Xj_i, (jH]) shows 
that the difference between the characteristic functions of the members of (14. 5 p 
is at most Cn*""" ■ e"*^"" " < Ce"'^'" " (the terms Vj ensure that it is sufficient to 
consider Fourier parameters bounded by eo). The estimate (14. 5 p then follows from 
Lemma [3.51 by taking T' = e"" and N = j. 
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Summing over j the estimate in (14. 5p . we obtain 
(4.6) 7r((Xo, . . . , X,_i), (^0, • • • , n-i)) < C7e-"'/2. 



By Strassen-Dudley Theorem 13.41 we can therefore construct a couphng between 
those processes such that, outside of a set O of measure at most Ce~" we have 
\Xi — Yi\ < Ce~" for < i < b — 1. As in the proof of Lemma [4. 2 [ this gives 



b-l 



j=0 



b-l 
j=0 



LP-v 



Since the Yj are independent and centered, Rosenthal's inequahty (Proposition 



13. 6p apphes. Let us write Vk = maxten YllH ^ ■ The Yj are bounded in 



L2 by (by (23])), and in L^-'' by C + Va-b^ < Cva-b'^. Hence, 



LP-v 



b-l 
j=0 



'b-l 



1/2 



'b-l 



l/ip-v) 



p-rj 
fe2 



< ^ Cva-b^b^^^"""^ 



LP-v 



0=0 



vj=0 



Since Xj = Xj + Vj and Vj is bounded by C in L*' we get from the two previous 
equations 



6-1 



j=0 



Finally, 



m+n—l 



< 



LP-v 



b-l 

E^. 

j=0 



+ EEii^ 



m+n—l 



E 



=m+a6 



LP-v 



Therefore, since 3a < 1/2, 



/ip-'i) 



+ Vn-ab- 



Moreover, a < n^^°', h <n°' and n — ah<a + h+ l< Cn^^°'. If Vn = 0{n^'), this 
gives Vn = 0{n'^) for s = s{r) = max(l/2, (1 — a)r + a/{p — rj)). Starting from the 
trivial estimate Vn = 0{n), we get f„ = 0(n**-^-'), then Vn = 0{n'^^'^^^^^) and so on. 
Since p — r] > 2, this gives in finitely many steps f„ = 0(n^/^). □ 

5. Proof of the main theorem for nondegenerate covariance 

matrices 

In this section, we consider a process {Aq, Ai, . . .) satisfying the assumptions of 
Theorem ll.3[ and such that the matrix is nondegenerate. We will prove that 
this process satisfies the conclusions of Theorem 11.31 Replacing without loss of 
generality A^ by Ai — E{A£), we can assume that Ai is centered. If is a finite 
subset of N, we denote its cardinality by \K\. 
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The strategy of the proof is very classicah we subdivide the integers into blocks 
with gaps between them, make the sums over the different blocks independent 
using the gaps and ([H]), use approximation results for sums of independent random 
variables to handle the now independent sums over the different blocks, and finally 
show that the fluctuations inside the blocks and the terms in the gaps do not 
contribute much to the asymptotics. 

The interesting feature of our approach is the choice of the blocks. First, we 
subdivide N into the intervals [2", 2""'"^), and then we cut each of these intervals 
following a triadic Cantor-like approach: we put a relatively large gap in the 
middle, then we put slightly smaller gaps in the middle of each half, and we go on 
in this way. This procedure gives better results than the classical arguments taking 
blocks along a polynomial progression: this would give an error p/{3p — 2) in the 
theorem, while we obtain the better error term p/{^p — 4) with the Cantor-like 
decomposition. The reason is that, to create n manageable blocks, the classical 
arguments requires gaps whose union is of order n^, while the triadic decomposition 
only uses gaps whose union is of order n. 

We will now describe the triadic procedure more precisely. Fix /3 G (0, 1) and 
e e (0, 1-/3). Let / = f{n) = [f3n\. We decompose [2", 2"+^) as a union of F = 2-^ 
intervals {In,j)o<j<F of the same length, and F gaps {Jn,j)o<j<F between them, used 
to give enough independence. Good intervals and gaps are laid alternatively and in 
increasing order, as follows: [2", 2"+-'^) = J„ o U In,o U Jn,i U /„,i ■ ■ ■ U Jn,F-i U In,F-i- 

The lengths of the gaps Jnj are chosen as follows. The middle interval Jn,F/2 
has length 2'-'^'^-l2-^~-'^. It cuts the interval [2", 2""'"-'^) into two parts. The middle 
interval of each of these parts, i.e., Jn,F/A and Jn,3F/4:, have length 2^'''^^2^~'^ . The 
middle intervals of the remaining four parts have length 2^'^"'^2-l'~^, and so on. 
More formally, for 1 < j < F, write j = ^{=oaA:(j)2^ where ak{j) E {0, 1}, and 
consider the smallest number r with ct^O) 7^ 0, then the length of Jnj is 2L'^"-l2^. 
We say that this interval is of rank r. This defines the length of all the intervals 
Jnj, except for j = 0. We let | J„,o| = 2l-'^"-l2-^, and say that this interval has rank 



Since there are 2^ ^ intervals of rank r for r < /, with length 2L^"J2^ the 
lengths of the intervals {Jn,j)o<j<F add up to 



Let \In,j\ = 2""-^ — (/ + 2)2^™-!"-*^, this is a positive integer if n is large enough, and 
Yl \^n,j\ + \ Jn,j\ = 2", i.e., those intervals exactly fill [2"-, 2"+^). We will denote 
by in,j the smallest element of Inj- 

We will use the lexicographical order -< on the set {(n, j) | n G N, < j < F{n)}. 
It can also be described as follows: {n,j) -< {n',j') if the interval is to the left 
of In',f- A sequence {nk,jk) tends to infinity for this order if and only if — )• oo. 



(5.1) 




r=0 
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Let Xnj = J2eei„j for n G N and < j < F{n). Write finally X = [j^ j /„j 
and J' = [Jnj "^nj- The main steps of the proof are the following: 

(1) There exists a coupling between (Xnj) and a sequence of independent 
random variables (Ynj), with Ynj distributed like Xnj, such that, almost 
surely, when (n,j) oo. 



(2) There exists a coupling between (l^„j) and a sequence of independent gauss- 
ian random variables Z„.j, with cov(Z„j) = |/„^j|S^, such that, almost 
surely, when (n,j) — )■ cxd. 



{n',j')^{n,j) 



(3) Coupling the X„ j with the Z„ j thanks to the first two steps and writing 
Znj as the sum of \In,j\ gaussian random variables A/'(0, S^), we obtain 
(using Lemma 13.11 and the example that follows it) a coupling between 
{Ai)£^x and {Bf)^(zx where the are i.i.d. and distributed like A^(0, S^), 
such that, when (n, j) tends to infinity. 



(4) We check that, almost surely, when (n, j) — )■ oo, 



max 

irL<\In,j\ 



E -4' 



o(2 



((l-/3)/2+/3/p+e)n 



)• 



Moreover, a similar estimate holds for the B^s. 
(5) Combining the last two steps, we get when k tends to infinity 



e<k, eel 



(6) Finally, we prove that the gaps can be neglected: almost surely, 
(5.2) Yl A, = o{k^l'^^), 

i<k, l£j 



and a similar estimate holds for the B(^s. 
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Altogether, this gives a couphng for which, almost surely 



E 

e<k 



Af-Bf 



Let us choose (3 so that the two error terms are equal, i.e., (3 = p/{2p — 2). We 
obtain an almost sure invariance principle with error term p/ {Ap — 4) + e, for any 
e > 0. Since the almost sure invariance principle implies the central limit theorem, 
this proves Theorem II. 3^ under the assumption that is nondegenerate. 

It remains to justify the steps (1), (2), (4) and (6), since the steps (3) and (5) 
are trivial. This is done in the next paragraphs. 

5.1. Step (1): Coupling with independent random variables. In this para- 
graph, we justify the first step of the proof of Theorem II. 3^ with the following 
proposition. 

Proposition 5.1. There exists a coupling between {Xnj) and [Ynj) such that, 
almost surely, when {n,j) tends to infinity. 



E 



(n'j')^(nj) 



o(2 



{/3+e)n/2^ 



The rest of this paragraph is devoted to the proof of this proposition. 
Let Vnj, for n,j G N, be independent copies of V (constructed in Proposition 
independent from everything else (we may need to enlarge the space to ensure 
their existence). Let Xnj = Xnj + Vnj. 

We will write X„ = (X„j)o<j<F{n) and X„ = (-^n,j)o<j<F(n)- The proof of 
Proposition 15.11 has two parts: first, we make the different X„ independent from 
each other, using the gaps Jn,o- Then, inside each block X„, we make the variables 
Xn j independent by using the smaller gaps J, 



raj- 



Lemma 5.2. Let Qn be distributed like Xn but independent of {Xi, 
have 



,Xn-i). We 



(5.3) 7r((Xi, 

Proof. The process (Xi, . . . ,X„) takes its values in for D = J2m=i^i'^) — 
X]m=i ^'^"^ < C2^"'. Moreover, each component in M.'^ of this process is one of 
the X„ j, hence it is a sum of at most 2" consecutive variables A£. On the other 
hand, the interval J„^o is a gap between (Xj)j<„ and X„, and its length k is 
(j±i2m+f5n^ Let and 7 : — )■ C denote respectively the characteristic func- 
tions of (Xi, . . . , Xn-i, Xn) and (Xi, . . . , X„_i, Qn), where Qn is a copy of X„, 
independent of (Xi, . . . ,X„_i). The assumption (jH]) ensures that, for Fourier pa- 
rameters tnj all bounded by eo, we have 



7l < C7(l + 2 



n\CD^-ck 



< Ce 
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if n is large enough. 

Let (f) and 7 be the characteristic functions of (Xi, . . . , X„) and (Xi, . . . , Qn): 
they are obtained by multiplying (p and 7 by the characteristic function of V in 
each variable. Since this function is supported in < eg}, we obtain in particular 

(5.4) \^-j\<Ce-'''"^'\ 

We then use Lemma 13.51 with N = D and T' = e^'"''^ to get 

7r((Xi, . . . , (Xi, . . . , Xn-l, Qn)) 

m<n j<F{m) 



e 



The second term is again bounded by £-^'2''"+""^ while each term in the first sum 
is bounded by e~^^"^^ E{\Xrn,j\) < e~^^"^^ ■ C2"'. Summing over m and j, we obtain 
a bound of the form Ce"^*", which is stronger than (15. 3p . □ 

Corollary 5.3. Let = {Rn,j)j<F{n) be distributed like X„, and such that the 
Rn are independent from each other. There exist C > and a coupling between 
(Xi,X2,...) and (i?i, -R2, • • • ) such that, for all {n,j), 

(5.5) P{\Xn,j - RnJ > C4-") < 



Proof. By Lemma [3l2| it is enough to build such a coupling between (Xi, . . . , X^v) 
and (i?i, . . . , Rn) for fixed X (we just have to ensure that the constant C we obtain 
is independent of X, of course). 

We use Lemma 15.21 to get a good coupling that makes X^v independent of the 
other variables, then use again this lemma to make X^v^i independent of the other 
ones, and so on. In the end, this yields the desired coupling between X and R. 

Let us be more formal. For n < X, we denote by (-R", . . . , i?^) a process 
distributed like (Xi, . . . ,X„). Let also i?„ be distributed like X„, independent of 
everything else. For 1 < n < X, Lemma 15.21 and Strassen-Dudley Theorem 13.41 
give a good coupling between (i?", . . . , R!^) and (-R""^, . . . , Rn-l, Rn)- Putting all 
those couplings together on a single space (by Lemma [3.11) . we obtain a space on 
which live in particular {Ri , . . . , R^) and (-Ri, . . . , Rn), which are the processes 
we are trying to couple. Moreover, 



N 



\Rn - Rn\ < ^ l-R^ - -R^ ^1 + l-R^ - -R„|- 
j=n+l 

If iRi-Rf^l < CA-^ for j G [n + l,X] and |K-^n| < <^4-", we get \R^-Rn\ < 
C'4:~"- for some constant C independent of rz and X. In particular, P{\R^ — Rn\ > 
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C'A ") is bounded by 



N N 

J2 - > C4-^) + P{\K -Rn\> C4-") < CA-^ < □ 

j=n+l j=n 

Lemma 5.4. For any n & N, we have 

(5-6) 7r((i?„j)o<j<F(n), (^n,i)o<j<F(ri)) < C4~", 

where Ynj = Ynj + Vnj ■ 

Proof. Let / = f{n) = \_(3n\ and F = 2^ . We will first make the variables 
{Rn,j)j<F/2 independent of the variables {Rn,j)F/2<j<F by using the large gap J„,f/2, 
then proceed in each remaining half using the gap in the middle of this half, and 
so on. 

Formally, we define Y^j, for < i < /, as follows: for < A; < 2-^~\ the random 
variable Y^^^ '■= (^r!jOfe2'<i<(fe+i)2' is distributed like (X„j)fc2><j<(fc+i)2^ and 3^;^ 
is independent from y^ f,, ii k ^ k'. Hence, the process (Y^j)o<j<F coincides with 
(^n,i)o<j<F^ while jY'„°j)o<j<F coincides with {Yn,j)o<j<F- 

Writing F' = {Y^j)j<F, let us estimate Tr(Y\Y^~^) for 1 < i < /. Since the 
variables are already independent from one another for < A; < 2-^^*, we have 

2f-*-l 

(5.7) n{Y\r^-')< J2 ^iKk^iK2lyn-:2Ui))- 

k=0 

Moreover, ^ is made of 2* sums of variables A^s along blocks, each of these 
blocks has length at most 2""-^, and there is a gap J„,a:2»+2»-i of size in 
the middle. Therefore, (jH]) ensures that the difference between the characteristic 
functions of and {y-^l, X ^2k+i) i^ most 

C{1 + 2"-/)^2«e-^2'"^^ < Ce^'^2'-c2-+» < Ce-^'2'"^\ 

if n is large enough. Taking N = 2^ and T' = e^'"''^ in Lemma 13. 5^ we obtain (with 
computations very similar to the proof of Lemma 15. 2p 

^iZk,iZ:2lyn-:2Ui))<ce-''\ 

for some 6 > 0. Summing over k in (15. 7p and then over i, we obtain 

/ 

vr(F^,f°) < 5^7r(F\f*-i) < /2-^Ce-2'" < Ce-^'"/'. 

i=l 

This gives in particular (15. 6p . □ 



Proof of Proposition 15. ii Let us put together the coupling constructed in Corollary 
15.31 with the couplings constructed in Lemma (5.41 for each n, thanks to Strassen- 
Dudley Theorem 13.41 We get a coupling between (X^j) and (Ynj) such that 
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P{\Xn,j - Ynjl > (^4-") < Since Enj4~" < oo, Borel-Cantelli ensures 

that, almost surely, 



(5i 



sup 



< OO. 



Moreover, Xnj = Xnj + Vnj where the random variables Vnj are centered, inde- 
pendent and in L^. By the law of the iterated logarithm, almost surely, for any 
a > 0, 



{n',j')^{n,j) 



o(Card{ (n',j') | {n',f) ^ 



Moreover, Card{(n', j') | {n',f) -< (n, j)} < E"'=i Ej'<f(„') 1 < C2^"- We obtain 
almost surely 



Xn',j' — Xn',j' 

{n',j')'<{n,j) 



o(2 



/3n(l/2+a) 



)• 



A similar estimate holds for Ynj — Ynj. With (15. 8p . this proves the proposition. □ 

5.2. Step (2): Coupling with Gaussian random vectors. We will use Corol- 
lary 3 of |Zai07] ■ let us recall it here for the convenience of the reader in a form 
that is convenient for us (it is obtained from the statement of Zaitsev by taking 
r = 10/e, 7 = g, = ikf, n = b and z' = Mz/5). 

Proposition 5.5. Let Yq, . . . , Yh^i be independent centered W^-valued random vec- 

tors. Let q > 2, and let M = ( 2Jj=o-^l^r ) ■ Assume that there exists a se- 
quence = rriQ < mi < ■ ■ ■ < ntg = b satisfying the following condition. Let 
(k = ^rrife + ■ ■ ■ + ynik+i-i ^'^^ -Bfc = cov ^fc, wc assumc that 



(5.9) 



2l,|2 



imM^\v\^ < {BkV,v) < 100CM^\v 



for all f G M*^ and all < k < s , and for some constant C > 1. Then there exists 
a coupling between {Yq, . . . , Yf,_i) and a sequence of independent gaussian random 
vectors {Sq, . . . , Sh-i) such that cov 5'^ = cov Yj, and moreover 



(5.10) 



P I max 

l<i<b 



j=0 



>Mz]< C'z-" + exp {-C'z) 



for all z > C'\ogn. Here, C is a positive quantity depending only on C, the 
dimension d and the integrability exponent q. 



The following lemma easily follows from the previous proposition. 
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Lemma 5.6. For n E N, there exists a coupling between (F„_o, • • • , ^n,F(n)-i) o.nd 
{Snfi: • • • ) Sn,F{n)-i) where the Snj are independent centered gaussian vectors with 
cov Sn J = covYnj, such that 



(5.11) 



EH. 



max 

<i<F{n) 



j=0 



y 2{(l'l3)/2+P/p+e/2)n 



< OO. 



Proof. Let q G (2,p) and let n E N. We want to apply Proposition 15.51 to the 
independent vectors (^nj)o<j<F, where F = F{n) = 2'-^"-'. 

By Proposition im we have < (:72(^-^)"/2_ jhis implies that M := 

{Yjj=o ^ I I ^ j satisfies 



(5.12) 



By the assumptions of Theorem II. 3 [ covy„j = |J„j|E^ + o(|/„j|") for any a > 0. 
In particular, 



(5.13) 



covy,,, = 2(i-^)"S2(l + o(l)). 



Moreover, we assume that matrix is nondegenerate. Therefore, there exists a 
constant Cq such that, for any < m < m' < F with m' — m large enough, and 
for any vector v, 



I m' — l 



Co^m' - m)2(i-^)"|t;p < / ^ covYr,jV,v ) < Co{m' - m)2(^-^)"| 



For m = and m' = F, the quantity {m' — m)2^^~^^'^ = 2^^"'^ .2^^"^)" is much larger 
than M^, by 05.121) . On the other hand, each individual term (for m' = m + 1) is 
bounded by 



cov y^.^,- 1 1 f I < ||i^ij||^2 



- nj \\Li 



Therefore, we can group the Ynj into consecutive blocks so that (15. 9p is satisfied, 
for some constant C. 

Applying Proposition 15.51 we get a coupling between (F„,0) • • • ? ^n,F-i) and 

{Snfiy • • • ! Sn^F-l) SUch that 



(5.14) 



P I max 

l<i<F 



i-1 
j=0 



> 2'"/^M < C2"«'"/l 



by fl5.10p for z = 2'^"/'^. This quantity is summable in n. Since 2'^"'/^M < 
2((i-/')/2+/3/p+<:/2)n jf g jg close euough to p and n is large enough, this concludes 
the proof of the lemma. □ 
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Lemma 5.7. Let Z^j be independent gaussian random vectors with cov Znj = 
There exists a coupling between (Snj) and {Z^j) such that, almost surely, 

(5.15) Sn',r-Zn',r=0{2'^^^'^-'^). 

in',j')^{n,j) 

Proof. Let a > 0. Let Enj = J\f{0, l/njlS^ + 2""/^), where Id is the identity 
matrix of dimension d. By assumption, covS'^j = |/„j|S^ + o(2""). In particular, 
if n is large enough, we can write |/„j|E^ + 2""'/^ = covZ„j + M„ j where the 
matrix M„ j is positive definite and |M„ j| < C2"". Therefore, Enj is the sum of 
Snj and an independent random variable distributed like A/'(0, Mnj)- In the same 
way, Enj is the sum of Znj and of an independent gaussian M{0, 2""/^). Putting 
those decompositions on a single space thanks to Lemma \37L\ we obtain a coupling 
between (Snj) and (Znj) such that the difference Dnj = Snj — Znj is centered, 
and with IID^jII^^ < C2°"/2. 

We claim that this coupling satisfies the conclusion of the lemma if a < e/2. 
Indeed, by Etemadi's inequality |Bil99t Paragraph M19], we have for any n 



P 



max 

l<i<F(n) 



j-1 



3=0 



> 



2(/3+e/2)n/2 



< C max P 

l<i<F(n) 



< C max E 

l<i<F(n) 




F{n)-1 
j=0 

This is summable. Therefore, almost surely, for large enough n and for 1 < i < 
F(n),wehave Ej=I)^n,j < 2(^+^/2W2_ rpj^g gg^j^^^^g (j5j5j) f^ll^^g^ □ 

Putting together the couplings constructed in Lemmas 15.61 and 15.71 we obtain a 
coupling satisfying the conclusions of Step (2). 

5.3. Step (4): Handling the maxima. We recall that inj is the smallest ele- 
ment of the interval Inj- 



Lemma 5.8. Almost surely, when {n,j) — ?■ oo, 

^(^2((l-/3)/2+/3/p+e)n^_ 



max 

m<\In,i 



in.j+m 

E 1' 
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Proof. Let q G (2,p). In L'^, the partial sums Yli^e=a^e bounded by C(6 — a)^/^, 
by Proposition 14.11 Denote = maxa<„<5 |X]"=a ^^|- Corollary Bl in |Ser70j 
then also shows that 



(5.16) Ml<Cib-a) 



1/2 



for a different constant C. In particular, if z/ = (1 — /3)/2 + + e/2, 
Moreover, 

This sum is finite if q is close enough to p. Borel-Cantelli gives the desired result. 

□ 

5.4. Step (6): The gaps. Recall that J is the union of the gaps J„j. In this 
paragraph, we prove the following lemma. 

Lemma 5.9. For any a > 0, there exists C > such that, for any interval J C N, 



E 



E -4' 



< c\Jnj\ 



l+a 



Together with Gal-Koksma strong law of large numbers (Proposition l3.7l) applied 
with q = l + a, this shows that, for every a > 0, almost surely, 

J2 A = o{\jn[o,kp'+"). 

i<k,£&J 

Moreover, for k e [2",2"+i), we have (by flETD ) 

n F{m)-1 n 

IJ^ n [0, A;] I < ^ I Jmj I < C ^ 7^2^"^+^™ < Cn2^"+^" < Ck^^^''^. 

m=l j=0 m=l 

With the previous equation, we obtain (if a is small enough) 

J2 A, = o{k^'^+^). 

This is (15.21) . as desired. 



Proof of Lemma We will freely use the convexity inequality 
(5.17) (ai + ■ ■ ■ + ttkf < kial + ■■■ + al). 
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Let J C N be an interval. We decompose J fl JT" as JqU JiU J2, where Jq and J2 
are, respectively, the first and the last interval of J H JT, and Ji is the remaining 
part (it is therefore a union of full intervals of J'). Then 



ieJnj 



< 3 



E-4- 

1&.J2 



The set Jq is an interval, hence Proposition 14.11 gives E IXI^gJo^^I — ^I'^oI- A 
similar inequality holds for J2. To conclude the proof, it is therefore sufficient to 
prove that 

2 



(5.18) 



E 



E^' 



<C\J, 



\l + OL 



Since Ji is not an always an interval, this does not follow directly from Proposition 
14.11 However, this is trivial if Ji is empty. Otherwise, let N be such that max Ji G 
[2^,2^"^^). Since the last interval in Ji is contained in [2^,2^+-^), its length is 
2VeN\+r gQ^g ^ g f{N)]. In particular, < Clog | Ji|. 

We defined the notion of rank of an interval J„j in the paragraph before Equa- 
tion (15. ip : such an interval has rank r G [0, /(n)] if its length is 2^'^'^^^'^. There 
are 2^'^'^^~^~'^ intervals of rank r in [2", 2""^^) for r < /(n), and one interval of rank 
f{n). 

For G N and < r < /(n), let denote the union of the intervals 

Jnj which are of rank r. The number of sets jT"^"''"-' intersecting Ji is at most 
X]^=o(/(^) + -'-) — CN"^. Hence, by the convexity inequality (I5.17p . 

2 



(5.19) 




2 


E ^' 











Let us fix some (n, r), and let us enumerate the intervals of j/'*-"'''-' as i^i, i^t 

f{n)). Let Ts = ^^g^ A^. We claim 



for t = 2^(")-i-^ if r < f{n) (or t = 1 if r 
that, for any subset S" of {1, ... , t}, 

2 



(5.20) 



E 



s&S 



< Y,E\Ts\^ + C\S\. 



s&S 



Let us show how this concludes the proof. By Proposition 14. H we have E\Ts\'^ < 
C\Ks\. Therefore, for any set K which is a union of intervals in j/'*-"''^-', we obtain 
E\ 'I2eeK^i\'^ — ^\^\- This applies in particular to i^' = Ji fl J'^"'^\ Therefore, 
f lETgj) gives 



E 



< CiV^^lJiH = CA^Vl|• 



Together with the inequality N < C log | Ji | , this proves (I5.18P as desired. 
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It remains to prove fl5.20p . We first make the Tg independent, as follows. Let 
{Ui, . . . , Ut) be independent random variables, such that Us is distributed like Tg. 
Let also Vi, . . . ,Vt, V(, . . . he independent random variables distributed like V 
(constructed in Proposition 13.81) and write Tg = Tg + Vg, Ug = Ug + Vg. We claim 
that, for some 6 > and C > 0, 

(5.21) nm,...,f,),{U„...,U,))<Ce-''\ 

To prove this estimate, we use the intervals of rank > r as gaps: we first make 
Ti,...,Tt/2 independent of Tt/2+i, • • • , using the gap Jn,F/2, then proceed in 
each half using the central gaps Jn,F/i and Jn,zF/A^ and so on. The details of the 
argument are exactly the same as in the proof of Lemma 15. 4[ 

Thanks to this estimate and Strassen-Dudley Theorem 13.41 we can construct a 
coupling between (Tj)i<j<t and {Uj)i<j<t such that, outside of a set O of measure 
at most Ce~^ we have \Tj — Uj\ < Ce~^ " for 1 < j < t. For any subset S of 
{1, . . . , t}, we obtain (as in the proof of Lemma | 



< 



L2 



ses 



+ 



L2 



lo= > Tg 



Ug 



+ 



L2 



L2 



The first term is bounded by ||1 



0\\li 



J2ses'^s\\ 7 where q is chosen so that 1/p- 



l/q = 1/2. Hence, it is at most Ce ^ "/i2^ < C. The second term is bounded by 



Cte- 



< C. We obtain 



E^. 


< 


E^. 


+ 


Ek 


<c+ 




+ 


E"- 


+ 


Ek' 


s€S 


L2 


ses 


L2 




L2 


ses 


L2 


ses 


L2 


ses 



L2 



Ug 



IL2 



The Ug are centered independent random variables, therefore ||X]sg5' 
iEEmf' = {Y.E{T^)f"- In the same way, WZsesH^^ 
C\S\^'^. We get \\T.sesTs\\L2 < C + {T.E{T^)f^^ + C\S\^'^, which implies 
fl5^ . □ 



6. Completing the proof of the main theorems 

In this section, we first finish the proof of Theorem 11.31 when the matrix T? is 
degenerate, and then we derive Theorem 11.21 from Theorem 11.31 

Lemma 6.1. Let {Aq, Ai, . . .) he a process satisfying the assumptions of Theo- 
rem \1.3\ for = 0. Then, almost surely, Yl^=o — o{n^) for any \ > p/ (4j9 — 4). 

Proof. Let /3 > and e > 0. Define a sequence of intervals In = [n'^'^^, (n + l)'^+^)n 
N, and denote by i„ 
surely, 

-1 



(6.1) 



|"n^+^] the smallest element of /„. We claim that, almost 

in-l 



1=0 
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and 



(6.2) 



max 

i€l„ 



0{n 



/3/2+l/p+eN 



Taking (3 = {p — 2)/p to equate the error terms, we get |^^<;;^^| = 0(n^/^"'"''), 
where n = n{k) is the index of the interval /„ containing k. Since n < Ck^^^^^^\ 
we finally obtain an error term 0(/c^"''^'*"^) for 

, _ 1 1 _ P 

2 ' 1 + (p-2)/p 4p-4' 

This concludes the proof. It remains to establish (16.1 p and (16. 2p . 

By (OD, llEto^ ^^11 = 0(71^/2) for any a > 0. Therefore, 

II IIl^ 

2 



i=0 



a /^l+2e 



L2 



Taking a = e, this quantity is summable. (16. ip follows. 
Denote = maxa<„<b |X]"=a ^^|- -^'-'^ q < p, we have 



max 



> W 



/3/2+1/p+e 



< 



Li 



Li 



< C{in+i — in)^^'^ < C-nPI"^ . Therefore, the last equation is 



BydLlHD, M^" 

bounded by C /n'^'^^/^^'^\ This is summable if q is close enough to p. The estimate 
(1^ follows. □ 

Let (Aq, v4i, . . . ) be a process satisfying the assumptions of Theorem 11.31 for 
some matrix T?. Replacing by Ai — E{Ai), we can assume that this process is 
centered. We decompose as an orthogonal sum E (B F, where is nondegen- 
erate on E, and vanishes on F. The almost sure invariance principle along E is 
proved in Section [5l while Lemma 16.11 handles F. This proves Theorem 11.31 

Finally, Theorem 11.21 directly follows from Lemma 12.71 and Theorem 11.31 

References 

[AWOO] Jon Aaronson and Benjamin Weiss, Remarks on the tightness of cocycles, CoUoq. Math. 

84/85 (2000), 363-376, Dedicated to the memory of Anzelm Iwanik. MR1784202, 
Cited page [71 

[BGK07] Jean-Baptiste Bardet, Sebastien Gouezel, and Gerhard Keller, Limit theorems for cou- 
pled interval maps, Stoch. Dyn. 7 (2007), no. 1, 17-36. MR2293068, Cited pageEl 

[Bil99] Patrick Billingsley, Convergence of probability measures, second ed., Wiley Series in 
Probability and Statistics: Probability and Statistics, John Wiley & Sons Inc., New 
York, 1999, A Wiley-Interscience Pubhcation. MRl 700749 , Cited pages [H and [H 



INVARIANCE PRINCIPLE VIA SPECTRAL METHODS 



31 



[BP79] Istvan Berkes and Walter Philipp, Approximation theorems for independent and weakly 

dependent random vectors, Ann. Probab. 7 (1979), no. 1, 29-54. MR515811, Cited 

pages [Illllllll andm 
[BT07] Viviane Baladi and Masato Tsujii, Anisotropic Holder and Sobolev spaces for hyperbolic 

diffeomorphisms, Ann. Inst. Fourier (Grenoble) 57 (2007), no. 1, 127-154. MR2313087, 

Cited pageO 

[DP84] Manfred Denker and Walter Philipp, Approximation by Brownian motion for Gibbs 
measures and flows under a function, Ergodic Theory Dynam. Systems 4 (1984), no. 4, 
541-552. MR779712, Cited pageffl 

[Ein89] Uwe Einmahl, Extensions of results of Komlos, Major, and Tusnddy to the multivariate 
case, J. Multivariate Anal. 28 (1989), no. 1, 20-68. MR996984, Cited pages [D and H 

[FMT03] Michael Field, Ian Melbourne, and Andrew Torok, Decay of correlations, central limit 
theorems and approximation by Brownian motion for compact Lie group extensions, 
Ergodic Theory Dynam. Systems 23 (2003), no. 1, 87-110. MR1971198, Cited page|3l 

[GH88] Yves Guivarc'h and Jean Hardy, Theoremes limites pour une classe de chaines de 
Markov et applications aux diffeomorphismes dAnosov, Ann. Inst. H. Poincare Probab. 
Statist. 24 (1988), no. 1, 73-98. MR937957, Cited pageH 

[GL08] Sebastien Gouezel and Carlangelo Liverani, Compact locally maximal hyperbolic sets 
for smooth maps: fine statistical properties, J. Differential Geom. 79 (2008), no. 3, 
433-477. MR2433929, Cited pageli 

[Gou08] Sebastien Gouezel, Characterization of weak convergence of Birkhoff sums for Cibbs- 
Markov maps. Preprint, 2008. Cited page [51 

[HK82] Franz Hofbauer and Gerhard Keller, Ergodic properties of invariant measures for piece- 
wise monotonic transformations. Math. Z. 180 (1982), no. 1, 119-140. IMR656227I 
Cited pages [1] and m 

[HP08] Loic Herve and Frangoise Pene, Nagaev method via Keller- Liverani theorem, Preprint, 
2008. Cited pagelH 

[KL99] Gerhard Keller and Carlangelo Liverani, Stability of the spectrum for transfer operators, 
Ann. ScuolaNorm. Sup. Pisa CI. Sci. (4) 28 (1999), no. 1, 141-152. MR1679080, Cited 
page El 

[Liv03] Carlangelo Liverani, Invariant measures and their properties. A functional analytic 
point of view, Dynamical systems. Part II, Pubbl. Cent. Ric. Mat. Ennio Giorgi, Scuola 
Norm. Sup., Pisa, 2003, pp. 185-237. MR2071MI Cited pageH 

[MN09] Ian Melbourne and Matthew Nicol, A vector-valued almost sure invariance principle 
for hyperbolic dynamical systems, Ann. Probab. 37 (2009), 478-505. Cited pages[Tl[6l 
andE 

[MT02] Ian Melbourne and Andrew Torok, Central limit theorems and invariance principles 
for time-one maps of hyperbolic flows. Comm. Math. Phys. 229 (2002), no. 1, 57-71. 
MR1917674, Cited page El 

[MT04] Ian Melbourne and Andrei Torok, Statistical limit theorems for suspension flows, Israel 
J. Math. 144 (2004), 191-209. MR2121540, Cited pagelS 

[PP90] William Parry and Mark PoUicott, Zeta functions and the periodic orbit structure of 
hyperbolic dynamics, Asterisque (1990), no. 187-188, 268. MR1085356, Cited page El 

[PS75] Walter Philipp and William Stout, Almost sure invariance principles for partial sums 
of weakly dependent random variables, Mem. Amer. Math. Soc. 2 (1975), no. issue 2, 
161, iv+140. MR0433597, Cited page[ll 

[Ros70] Haskell P. Rosenthal, On the subspaces of L^ [p > 2) spanned by sequences of inde- 
pendent random variables, Israel J. Math. 8 (1970), 273-303. MR 0271721i Cited page 

m 



INVARIANCE PRINCIPLE VIA SPECTRAL METHODS 



32 



[SauOO] Benoit Saussol, Absolutely continuous invariant measures for multidimensional expand- 
ing maps, Israel J. Math. 116 (2000), 223-248. MRl 759406, Cited page El 

[Ser70] Robert J. Serfling, Moment inequalities for the maximum cumulative sum, Ann. Math. 
Statist. 41 (1970), 1227-1234. MR0268938, Cited pagedH 

[Tsu08] Masato Tsujii, Quasi-compactness of transfer operators for contact Anosov flows. 
Preprint, 2008. Cited page HI 

[Zai98] Andrei Yu. Zaitsev, Multidimensional version of the results of Komlos, Major and 
Tusnddy for vectors with finite exponential moments, ESAIM Probab. Statist. 2 (1998), 
41-108 (electronic). MR1616527, Cited pages □ and H 

[Zai07] , Estimates for the rate of strong approximation in the multidimensional invari- 

ance principle, J. Math. Sci. (N. Y.) 145 (2007), 4856-4865. , MR2355400 , Cited pages 
Eland [Ml 



IRMAR, CNRS UMR 6625, Universite de Rennes 1, 35042 Rennes, France 
E-mail address: sebastien.gouezel@univ-rennesl.fr 



